AITopics | training game

Collaborating Authors

training game

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Search-contempt: a hybrid MCTS algorithm for training AlphaZero-like engines with better computational efficiency

Joshi, Ameya

arXiv.org Artificial IntelligenceApr-11-2025

AlphaZero in 2017 was able to master chess and other games without human knowledge by playing millions of games against itself (self-play), with a computation budget running in the tens of millions of dollars. It used a variant of the Monte Carlo Tree Search (MCTS) algorithm, known as PUCT. This paper introduces search-contempt, a novel hybrid variant of the MCTS algorithm that fundamentally alters the distribution of positions generated in self-play, preferring more challenging positions. In addition, search-contempt has been shown to give a big boost in strength for engines in Odds Chess (where one side receives an unfavorable position from the start). More significantly, it opens up the possibility of training a self-play based engine, in a much more computationally efficient manner with the number of training games running into hundreds of thousands, costing tens of thousands of dollars (instead of tens of millions of training games costing millions of dollars required by AlphaZero). This means that it may finally be possible to train such a program from zero on a standard consumer GPU even with a very limited compute, cost, or time budget.

artificial intelligence, planning & scheduling, training game, (16 more...)

arXiv.org Artificial Intelligence

2504.07757

Genre: Research Report (0.40)

Industry: Leisure & Entertainment > Games > Chess (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Add feedback

Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction

Jin, Yonggang, Zhang, Ge, Zhao, Hao, Zheng, Tianyu, Guo, Jiawei, Xiang, Liuyu, Yue, Shawn, Huang, Stephen W., Chen, Wenhu, He, Zhaofeng, Fu, Jie

arXiv.org Artificial IntelligenceFeb-7-2024

Developing a generalist agent is a longstanding objective in artificial intelligence. Previous efforts utilizing extensive offline datasets from various tasks demonstrate remarkable performance in multitasking scenarios within Reinforcement Learning. However, these works encounter challenges in extending their capabilities to new tasks. Recent approaches integrate textual guidance or visual trajectory into decision networks to provide task-specific contextual cues, representing a promising direction. However, it is observed that relying solely on textual guidance or visual trajectory is insufficient for accurately conveying the contextual information of tasks. This paper explores enhanced forms of task guidance for agents, enabling them to comprehend gameplay instructions, thereby facilitating a "read-to-play" capability. Drawing inspiration from the success of multimodal instruction tuning in visual tasks, we treat the visual-based RL task as a long-horizon vision task and construct a set of multimodal game instructions to incorporate instruction tuning into a decision transformer. Experimental results demonstrate that incorporating multimodal game instructions significantly enhances the decision transformer's multitasking and generalization capabilities.

game instruction, instruction, trajectory, (12 more...)

arXiv.org Artificial Intelligence

2402.04154

Country:

North America > Canada > Ontario > Toronto (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(14 more...)

Genre: Research Report > New Finding (0.34)

Industry: Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.66)

Add feedback

Enabling A Network AI Gym for Autonomous Cyber Agents

Li, Li, Rami, Jean-Pierre S. El, Taylor, Adrian, Rao, James Hailing, Kunz, Thomas

arXiv.org Artificial IntelligenceApr-3-2023

This work aims to enable autonomous agents for network cyber operations (CyOps) by applying reinforcement and deep reinforcement learning (RL/DRL). The required RL training environment is particularly challenging, as it must balance the need for high-fidelity, best achieved through real network emulation, with the need for running large numbers of training episodes, best achieved using simulation. A unified training environment, namely the Cyber Gym for Intelligent Learning (CyGIL) is developed where an emulated CyGIL-E automatically generates a simulated CyGIL-S. From preliminary experimental results, CyGIL-S is capable to train agents in minutes compared with the days required in CyGIL-E. The agents trained in CyGIL-S are transferrable directly to CyGIL-E showing full decision proficiency in the emulated "real" network. Enabling offline RL, the CyGIL solution presents a promising direction towards sim-to-real for leveraging RL agents in real-world cyber networks.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2304.01366

Country:

North America > Canada > Ontario > National Capital Region > Ottawa (0.28)
Oceania > Australia > Queensland > Brisbane (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > Canada > Ontario > Kingston (0.04)

Genre: Research Report (0.50)

Industry:

Information Technology > Security & Privacy (1.00)
Leisure & Entertainment > Games > Computer Games (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Artificial intelligence (AI) Real or Fake Text? We Can Learn to Spot the Difference

#artificialintelligenceMar-10-2023, 03:50:09 GMT

The most recent generation of chatbots has surfaced longstanding concerns about the growing sophistication and accessibility of artificial intelligence. Fears about the integrity of the job market -- from the creative economy to the managerial class -- have spread to the classroom as educators rethink learning in the wake of ChatGPT. Yet while apprehensions about employment and schools dominate headlines, the truth is that the effects of large-scale language models such as ChatGPT will touch virtually every corner of our lives. These new tools raise society-wide concerns about artificial intelligence's role in reinforcing social biases, committing fraud and identity theft, generating fake news, spreading misinformation and more. A team of researchers at the University of Pennsylvania School of Engineering and Applied Science is seeking to empower tech users to mitigate these risks.

artificial intelligence, callison-burch, intelligence, (9 more...)

#artificialintelligence

Country: North America > United States > Pennsylvania (0.25)

Genre: Research Report > New Finding (0.31)

Industry:

Media > News (0.72)
Education > Educational Setting (0.51)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games

Chaudhury, Subhajit, Kimura, Daiki, Talamadupula, Kartik, Tatsubori, Michiaki, Munawar, Asim, Tachibana, Ryuki

arXiv.org Machine LearningSep-24-2020

We show that Reinforcement Learning (RL) methods for solving Text-Based Games (TBGs) often fail to generalize on unseen games, especially in small data regimes. To address this issue, we propose Context Relevant Episodic State Truncation (CREST) for irrelevant token removal in observation text for improved generalization. Our method first trains a base model using Q-learning, which typically overfits the training games. The base model's action token distribution is used to perform observation pruning that removes irrelevant tokens. A second bootstrapped model is then retrained on the pruned observation text. Our bootstrapped agent shows improved generalization in solving unseen TextWorld games, using 10x-20x fewer training games compared to previous state-of-the-art methods despite requiring less number of training episodes.

generalization, observation text, training game, (15 more...)

arXiv.org Machine Learning

2009.11896

Genre: Research Report (0.84)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A Tic Tac Toe AI with Neural Networks and Machine Learning

#artificialintelligenceJun-25-2019, 21:19:57 GMT

This article is my entry for CodeProject's AI competition "Image Classification Challenge"[ ]. My goal was to teach a neural network to play a game of tic tac toe, starting from only knowing the rules. Tic tac toe is a solved game. A perfect strategy[ ] exists so a neural network is a bit overkill and will not perform as well as existing programs and humans can. Described from a high level: when the AI needs to make a move, it iterates over all possible moves, generates the board after making a given move, and uses the neural network to see how good the position is after performing that move.

artificial intelligence, machine learning, representation, (16 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games > Tic-Tac-Toe (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Race for the Galaxy AI

#artificialintelligenceMay-8-2018, 00:41:16 GMT

What makes a game replayable over time? It offers new challenges over and over again. One way to do that is to include an AI opponent that is so skilled, even advanced players will continue to be challenged after hundreds of hours of play. Race has been one of the top selling boardgames this year partly because of the neural network that powers its AI. Race for the Galaxy uses a temporal difference neural network.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games > Backgammon (0.33)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Brain training games may be a waste of time scientists say

Daily Mail - Science & techOct-3-2016, 17:36:01 GMT

Trendy brain training computer games may be a waste of time and money, scientists have revealed. They said that while people may get better at the exercise they practise, there is little or no evidence this helps them in their day to day lives. Numerous companies make packages of games, puzzles and exercises that are designed to improve memory, boost attention span or simply keep the mind sharp into old age. Brain training computer games may be a waste of time and money, scientists said last night. Researchers examined more than 130 studies into brain training.

artificial intelligence, scientist, training game, (16 more...)

Daily Mail - Science & tech

Country:

North America > United States > Illinois (0.07)
North America > United States > Wisconsin > Dane County > Madison (0.05)
North America > United States > California > Riverside County > Riverside (0.05)

Genre: Research Report > New Finding (0.66)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Consumer Health (1.00)

Technology: Information Technology > Artificial Intelligence > Cognitive Science (0.54)

Add feedback

Pairwise Relative Offset Features for Atari 2600 Games

Talvitie, Erik (Franklin and Marshall College) | Bowling, Michael (University of Alberta)

AAAI ConferencesMar-1-2015

We introduce a novel feature set for reinforcement learning in visual domains (e.g. video games) designed to capture pairwise, position-invariant, spatial relationships between objects on the screen. The feature set is simple to implement and computationally practical, but nevertheless allows for substantial improvement over existing baselines in a wide variety of Atari 2600 games. In the most dramatic results the features allow multiple orders of magnitude improvement in performance.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

AAAI Conferences

Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence

Country: